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ADAPTIVE HOSTED TEXT TO SPEECH PROCESSING 



FIELD OF THE INVENTION 

The present invention relates to speech processing and, more specifically to adaptive 
hosted text to speech processing. 

BACKGROUND OF THE INVENTION 
5 Humans learn information in a variety of ways. Two of the most common ways to 

learn information are reading text and listening to speech. In many situations, it is desirable 
to convert into audible speech information that is stored as written text. For example, a 
parent may read a bedtime story to a child. In certain applications, it is not practical to 
employ live humans to read a text out loud every time anyone wants to hear the information 

1 0 contained in the text. One approach for handling such situations is to record a human reading 
the text out loud, and then play back the recording every time someone wants to hear the 
information contained in the text. This approach is used, for example, to create audio 
recordings of books. 

Unfortunately, even creating recordings of texts is not practical for many 

1 5 applications. For example, a news company may desire to have all of its news stories 

available as audible speech as well as written text. However, the volume of news stories may 
make it impractical to have someone read and record all of them. The cost of recording the 
full-text readings becomes unpractically high in many modern applications, such as services 
that present as audible speech information from thousands or millions of electronic sources 

20 of textual information, such as web pages on the World Wide Web. 

For applications where full-text readings are impractical, it is possible to store partial- 
text readings and then combine the partial-texts readings during playback. For example, a 
human can record the reading of every word in a dictionary, and playback the single-word 
recordings in the sequence that the words appear in a text. However, this only works when 
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the reader can anticipate every word or phrase in the text. As a practical matter, it is 
impossible to pre-record all possible words and phrases without knowing the exact content of 
the texts involved. Thus, the partial-text reading technique works well when the content of 
all texts involved is known ahead of time, but does not work when it is not. 
5 When the exact content of texts is not known ahead of time, the text is said to contain 

"unanticipated content". One approach to providing text-to-speech service for texts that may 
contain unanticipated content involves the use of a "synthesized voice". A synthesized voice 
is produced by programming a device (not an actual human) to pronounce words contained 
within an input text based on a complex set of pronunciation rules. Unfortunately, even the 
1 0 most sophisticated voice synthesis techniques produce "readings" of notoriously poor quality 
that many listeners find unacceptable. 

Based on the foregoing, it is clearly desirable to provide improved text-to-speech 
techniques. In particular, it is desirable to provide improved text-to-speech techniques for 
situations in which the input texts may contain unanticipated content. 
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SUMMARY OF THE INVENTION 

Techniques are provided performing text-to-speech translation in situations in which 
the input texts may contain unanticipated content. According to one aspect of the invention, 
text-to-speech services are provided by splitting a text into segments that include anticipated- 
5 content segments and unanticipated-content segments. Speech for the anticipated-content 
segments is generated based on pre-recorded sound recordings that correspond to the 
anticipated-content segments. Speech for the unanticipated-content segments is generated 
using speech synthesis. 

According to another aspect of the invention, usage statistics are recorded. The usage 

1 0 statistics identify which segments are contained in texts that are translated using the text-to- 
speech services. In one embodiment, the usage statistics indicate frequency of use of 
unanticipated-content segments and, based on the usage statistics, a set of unanticipated- 
content segments for which to make recordings is selected. In another embodiment, the 
usage statistics indicate frequency of use of anticipated-content segments, and a set of 

15 anticipated-content segments is selected based on the usage statistics. The recordings 
associated with the selected anticipated-content segments are then removed. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example, and not by way of limitation, 
in the figures of the accompanying drawings and in which like reference numerals refer to 
similar elements and in which: 

FIG. 1 is a block diagram of a system configured according to an embodiment of the 
invention; and 

FIG. 2 is a block diagram of a computer system upon which embodiments of the 
invention may be implemented. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Techniques are provided performing text-to-speech translation in situations in which 
the input texts may contain unanticipated content. In the following description, for the 
purposes of explanation, numerous specific details are set forth in order to provide a 
5 thorough understanding of the present invention. It will be apparent, however, to one skilled 
in the art that the present invention may be practiced without these specific details. In other 
instances, well-known structures and devices are shown in block diagram form in order to 
avoid unnecessarily obscuring the present invention. 

SYSTEM OVERVIEW 

;a *_ 10 Referring to FIG. 1, it is a block diagram of a system 100 in which the techniques 

^ described herein may be employed. System 1 00 includes a plurality of text sources 1 02-1 08, 
: ^ a plurality of users 120-126, and a text-to-speech host 110. 

Text sources 1 02- 1 08 generally represent any type of source of any type of text. For 
^ example, in the context of the World Wide Web, text sources 1 02- 1 08 may be web pages that 
H45 include text. Alternatively, texts sources 102-108 may represent electronic versions of 
□ books. Text sources 1 02- 1 08 may be stored together and controlled by a single party, or may 
p be stored separately and controlled by many parties. The present invention is not limited to 
any particular type of textual source, or any particular storage or control arrangement. 

Text-to-speech host 1 10 generally represents the host of a service for providing users 
20 with audible speech of text sources 1 02- 1 08 . Text-to-speech host 1 1 0 may be the owner of 
text sources 102-108, or may be a third party completely separate from the owners and/or 
producers of text sources 102-108. For example, text sources 102-108 may represent text 
contained in web pages throughout the World Wide Web, while text-to-speech host 1 10 is a 
service, connected to the World Wide Web, that provides the services of converting to 
25 audible speech the text of web pages specified by users. 
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Users 120-126 generally represent the entities that desire audible speech versions of 
text sources 102-108. Users 120-126 maybe, for example, humans that place telephone calls 
to text-to-speech host 1 10 to have content contained in text sources 102-108 read to them 
over the telephone. Alternatively, users 120-126 may be computer processes that process 
5 speech input. Users 120-126 may also be humans that desire to have their email read to them 
over the telephone, where text sources 102-108 represent their email. The present invention 
is not limited to any particular type of audible speech recipient. 

FUNCTIONAL OVERVIEW 
According to one embodiment, text-to-speech host 110 employs a technique that 
10 combines the best of the partial-text recording and voice synthesis techniques described 
s 3 above. In particular, text-to-speech host 1 10 maintains pre-recorded content 130 of 
□ frequently used words and phrases. The pre-recorded content 130 may be maintained, for 
:= g example, as pre-recorded sound files in a database. 

jVj Whenever text-to-speech host 1 10 is asked to translate text to speech, host 110 splits 

; J 5 the text into segments. The resulting segments generally include anticipated-content 
I-.*-. segments and unanticipated-content segments. The anticipated-content segments are 

segments that correspond to pre-recorded content 130. The unanticipated-content segments 
1 are segments that have no corresponding pre-recorded content 130. 

After splitting the text input segments, text-to-speech host 110 translates the text to 
20 speech by playing back the pre-recorded content 1 30 for the anticipated-content segments, 
and converting the unanticipated-content segments to speech using voice synthesis 
techniques. 

ADAPTIVE TEXT-TO-SPEECH TECHNIQUES 
In general, pre-recorded speech is easier to understand and preferable to voice 
25 synthesis speech. Therefore, according to one aspect of the invention, text-to-speech host 
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110 employs adaptive techniques to increase the percentage of speech output that is covered 
by pre-recorded content 130. 

According to one embodiment, text-to-speech host 110 maintains usage statistics 140. 
Usage statistics 140 generally represent information about how users are using text-to-speech 
5 host 110. Usage statistics 140 may include, for example, data that identifies the 

unanticipated-content segments that have been translated within a particular time period, and 
the frequency at which each of the unanticipated-content segments was translated. 

According to one embodiment, a set of unanticipated-content segments is periodically 
selected based on the usage statistics 140. For example, the usage statistics 140 may be used 
1 0 to identify and select the unanticipated-content segments that were most frequently requested 
^ during the most recent time period. The unanticipated-content segments thus selected are 
^ then presented to a speech recorder ("voice"). The voice then records the words and/or 
; s 5 phrases that correspond to the selected unanticipated-content segments, and stores the 

recordings along with the existing pre-recorded content 130. 
M 5 Consequently, when those words or phrases are encountered in text that is 

M subsequently requested, those words and phrases will correspond to pre-recorded content 
□ 1 30, and therefore will be processed as anticipated-content segments rather than 
q unanticipated-content segments. Specifically, the text-to-speech host 1 10 will play back the 

newly-recorded sound files for those segments, rather than translating them using speech 
20 synthesis. 

This process may be repeated continuously, thereby constantly increasing the quality 
of the speech produced by text-to-speech host 110. For example, each morning a person may 
record the ten most frequently translated unanticipated-content segments of the previous day. 
Because the segments that are translated are those most frequently encountered, the relatively 
25 high-cost resource of human effort is used to its greatest efficacy. 
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DISCARDING PRE-RECORDED CONTENT 
According to one embodiment, usage statistics 130 are also used to determine pre- 
recorded content 1 30 to be discarded. For example, text-to-speech host 1 1 0 may record as 
part of usage statistics 140 the frequency with which pre-recorded content 1 30 is accessed. If 
5 the frequency with which a particular segment of pre-recorded content 130 drops below a 
predetermined level, the text-to-speech host 110 may automatically discard that segment, 
thus making available more storage space for new pre-recorded content 130. 

CONTENT-BASED SELECTION 
According to another aspect of the invention, more than one recording may be stored 

10 for a particular segment of text. For example, the word "cool" may correspond to two 
ji recordings, one that pronounces the word as is conventional in the context of temperature, the 
□ other of which pronounces the word as is conventional when used as slang. According to 
j* one embodiment, for each text segment that has more than one recording, rules are provided 
7% for selecting which recording to use in a given context. When the text-to-host 110 
" 1 5 encounters a segment for which there is more than one recording, the text-to-host 110 selects 

^ one of the recordings based on the rules associated with that segment, and uses the selected 
W recording to translate the segment to audible speech. 

Q The rules may select the appropriate recording, for example, based at least in part on 

the textual context in which the segment resides. For example, a rule may specify that the 
20 "temperature" pronunciation of the word "cool" is to be used when the word "cool" appears 
in a paragraph that also includes the word "temperature". 

Other factors that may be used to determine which recording to use may include, for 
example, the source of the text. Thus, if the source of the text is a weather service, than the 
"temperature" pronunciation of the word "cool" may be selected regardless of the words 
25 surrounding it. 
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NEWS SERVICE EXAMPLE 
The techniques described herein are particularly beneficial for environments in which 
a host performs text-to-speech translation for text that originates from a variety of outside 
sources. For example, text-to-speech host 110 may provide text-to-speech translations for a 
5 variety of news services. Due to the nature of current news, certain words that are rarely 
used may, for short periods of time, be used with a very high frequency. For example, the 
word "Kurst" is the name of a sunken Russian submarine. Prior to the sinking, the word 
would probably have never shown up in the text from the news sources. However, for the 
several weeks that followed the sinking, the word "Kurst" would appear in the news text with 
1 0 great frequency. 

«i Using the adaptive techniques described herein, the word "Kurst" would, shortly after 

the sinking, be selected as one of the most frequently encountered unanticipated-content 
;;!f segments. In response, a recording of the word would be stored with pre-recorded content 
;^ 130. Consequently, the pre-recording would be used in all subsequent text-to-speech 
45 translations of the word during the following weeks. Eventually, the Kurst would cease to be 
H ; mentioned in the news, and the recording of "Kurst" would be identified as a least frequently 
Q used recording. The "Kurst" recording would then be deleted to free up storage space. 

3 HARDWARE OVERVIEW 

Figure 2 is a block diagram that illustrates a computer system 200 upon which an 

20 embodiment of the invention may be implemented. Computer system 200 includes a bus 202 
or other communication mechanism for communicating information, and a processor 204 
coupled with bus 202 for processing information. Computer system 200 also includes a main 
memory 206, such as a random access memory (RAM) or other dynamic storage device, 
coupled to bus 202 for storing information and instructions to be executed by processor 204. 

25 Main memory 206 also may be used for storing temporary variables or other intermediate 
information during execution of instructions to be executed by processor 204. Computer 
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system 200 further includes a read only memory (ROM) 208 or other static storage device 
coupled to bus 202 for storing static information and instructions for processor 204. A storage 
device 210, such as a magnetic disk or optical disk, is provided and coupled to bus 202 for 
storing information and instructions. 
5 Computer system 200 may be coupled via bus 202 to a display 212, such as a cathode 

ray tube (CRT), for displaying information to a computer user. An input device 214, including 
alphanumeric and other keys, is coupled to bus 202 for communicating information and 
command selections to processor 204. Another type of user input device is cursor control 216, 
such as a mouse, a trackball, or cursor direction keys for communicating direction information 
1 0 and command selections to processor 204 and for controlling cursor movement on display 212. 
3 This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a 
y second axis (e.g., y), that allows the device to specify positions in a plane, 
"p. The invention is related to the use of computer system 200 for implementing the 

!"\ techniques described herein. According to one embodiment of the invention, those 
i5 techniques are performed by computer system 200 in response to processor 204 executing 
f~ one or more sequences of one or more instructions contained in main memory 206. Such 
3 instructions may be read into main memory 206 from another computer-readable medium, 
Q such as storage device 210. Execution of the sequences of instructions contained in main 

memory 206 causes processor 204 to perform the process steps described herein. In 
20 alternative embodiments, hard-wired circuitry may be used in place of or in combination with 
software instructions to implement the invention. Thus, embodiments of the invention are 
not limited to any specific combination of hardware circuitry and software. 

The term "computer-readable medium" as used herein refers to any medium that 
participates in providing instructions to processor 204 for execution. Such a medium may take 
25 many forms, including but not limited to, non- volatile media, volatile media, and transmission 
media. Non-volatile media includes, for example, optical or magnetic disks, such as storage 
device 210. Volatile media includes dynamic memory, such as main memory 206. 
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Transmission media includes coaxial cables, copper wire and fiber optics, including the wires 
that comprise bus 202. Transmission media can also take the form of acoustic or light waves, 
such as those generated during radio-wave and infra-red data communications. 

Common forms of computer-readable media include, for example, a floppy disk, a 
5 flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD-ROM, any other 
optical medium, punchcards, papertape, any other physical medium with patterns of holes, a 
RAM, a PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a 
carrier wave as described hereinafter, or any other medium from which a computer can read. 
Various forms of computer readable media may be involved in carrying one or more 
1 0 sequences of one or more instructions to processor 204 for execution. For example, the 
; * instructions may initially be carried on a magnetic disk of a remote computer. The remote 
; 1 computer can load the instructions into its dynamic memory and send the instructions over a 
I % telephone line using a modem. A modem local to computer system 200 can receive the data on 
"f! the telephone line and use an infra-red transmitter to convert the data to an infra-red signal. An 
vy l 5 infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry 
j a * can place the data on bus 202. Bus 202 carries the data to main memory 206, from which 

□ processor 204 retrieves and executes the instructions. The instructions received by main 

□ memory 206 may optionally be stored on storage device 210 either before or after execution by 
processor 204. 

20 Computer system 200 also includes a communication interface 218 coupled to bus 

202. Communication interface 218 provides a two-way data communication coupling to a 
network link 220 that is connected to a local network 222. For example, communication 
interface 218 may be an integrated services digital network (ISDN) card or a modem to 
provide a data communication connection to a corresponding type of telephone line. As 

25 another example, communication interface 218 may be a local area network (LAN) card to 
provide a data communication connection to a compatible LAN. Wireless links may also be 
implemented. In any such implementation, communication interface 218 sends and receives 
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electrical, electromagnetic or optical signals that carry digital data streams representing 
various types of information. 

Network link 220 typically provides data communication through one or more 
networks to other data devices. For example, network link 220 may provide a connection 
5 through local network 222 to a host computer 224 or to data equipment operated by an 
Internet Service Provider (ISP) 226. ISP 226 in turn provides data communication services 
through the world wide packet data communication network now commonly referred to as 
the "Internet" 228. Local network 222 and Internet 228 both use electrical, electromagnetic 
or optical signals that carry digital data streams. The signals through the various networks 
1 0 and the signals on network link 220 and through communication interface 218, which carry 

□ the digital data to and from computer system 200, are exemplary forms of carrier waves 
y transporting the information. 

i B jft Computer system 200 can send messages and receive data, including program code, 

through the network(s), network link 220 and communication interface 218. In the Internet 

^1 5 example, a server 230 might transmit a requested code for an application program through 
Internet 228, ISP 226, local network 222 and communication interface 218. 

; ^ The received code may be executed by processor 204 as it is received, and/or stored 

□ in storage device 210, or other non- volatile storage for later execution. In this manner, 
computer system 200 may obtain application code in the form of a carrier wave. 

20 

In the foregoing specification, the invention has been described with reference to 
specific embodiments thereof. It will, however, be evident that various modifications and 
changes may be made thereto without departing from the broader spirit and scope of the 
invention. The specification and drawings are, accordingly, to be regarded in an illustrative 
25 rather than a restrictive sense. 
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CLAIMS 

What is claimed is: 



1 1 . A method of providing text-to-speech services, the method comprising the steps of: 

2 splitting a text into segments that include anticipated-content segments and 

3 unanticipated-content segments; 

4 generating speech for said anticipated-content segments based on pre-recorded sound 

5 recordings that correspond to said anticipated-content segments; and 

6 generating speech for said unanticipated-content segments using speech synthesis. 

1 2. The method of Claim 1 comprising the steps of storing usage statistics 

; ^ 2 that identify which segments are contained in texts that are translated 

'1 3 using said text-to-speech services. 

L: 1 3. The method of Claim 2 wherein the usage statistics indicate frequency of use of at 

^ 2 least a set of said segments. 

^ 1 4. The method of Claim 3 wherein: 

□2 the usage statistics indicate frequency of use of unanticipated-content segments; and 

□3 the method includes the step of selecting, based on said usage statistics, a set of 

4 unanticipated-content segments for which to make recordings. 

1 5. The method of Claim 4 wherein the step of selecting a set of unanticipated-content 

2 segments includes selecting a set of unanticipated-content segments that were most 

3 frequently used during a time period. 

1 6. The method of Claim 3 wherein: 

2 the usage statistics indicate frequency of use of anticipated-content segments; and 

3 the method includes the steps of 
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4 selecting a set of anticipated-content segments based on said usage statistics; 

5 and 

6 removing recordings associated with the selected anticipated-content 

7 segments. 

1 7. The method of Claim 6 wherein the step of selecting a set of anticipated-content 

2 segments includes selecting a set of anticipated-content segments that were least 

3 frequently used during a period of time. 

1 8. The method of Claim 1 further comprising the steps of; 

2 recording a plurality of recordings for a particular anticipated-segment; 

: ;5 3 storing data that indicates rules for selecting between said plurality of recordings; and 

4 4 when said text contains said particular anticipated-content segment, applying the rules 

: ji 5 indicated in said data to select one of said plurality of recordings; and 

; T| 6 generating speech for said particular anticipated-segment using said selected 

" 7 recording. 

\Z. 1 9. The method of Claim 8 wherein: 

^ 2 the text is from a particular source; and 

□3 the step of applying the rules includes determining which of said plurality of 

4 recordings to select based at least in part on identity of said particular source. 

1 10. The method of Claim 1 wherein: 

2 the text is from one of a plurality of text sources managed by a plurality of parties; 

3 and 

4 the text-to-speech services are provided by a host, separate from said plurality of 

5 parties, that is connected to said text sources over a network system. 



-15- 

50277-1567 (OID # 2000-132-01) 



1 11. The method of Claim 1 0 wherein the text sources are web pages that contain text, and 

2 said network system is the World Wide Web. 

1 12. The method of Claim 8 wherein; 

2 the anticipated-content segment appears in a particular context within said text; and 

3 the step of applying the rules includes determining which of said plurality of 

4 recordings to select based at least in part on said particular context. 

1 13. A computer-readable medium carrying instructions for providing text-to-speech 

2 services, the instructions including instructions for performing the steps of: 

3 splitting a text into segments that include anticipated-content segments and 
u3 4 unanticipated-content segments; 

I ;3 5 generating speech for said anticipated-content segments based on pre-recorded sound 

Jg 6 recordings that correspond to said anticipated-content segments; and 

C\ 7 generating speech for said unanticipated-content segments using speech synthesis. 

[~ 1 1 4. The computer-readable medium of Claim 1 3 comprising the steps of 

; ^ 2 storing usage statistics that identify which segments are contained in 

O 3 texts that are translated using said text-to-speech services. 

1 15. The computer-readable medium of Claim 14 wherein the usage statistics indicate 

2 frequency of use of at least a set of said segments. 

1 16. The computer-readable medium of Claim 1 5 wherein: 

2 the usage statistics indicate frequency of use of unanticipated-content segments; and 

3 the computer-readable medium includes the step of selecting, based on said usage 

4 statistics, a set of unanticipated-content segments for which to make 

5 recordings. 
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1 1 7. The computer-readable medium of Claim 1 6 wherein the step of selecting a set of 

2 unanticipated-content segments includes selecting a set of unanticipated-content 

3 segments that were most frequently used during a time period. 

1 18. The computer-readable medium of Claim 1 5 wherein: 

2 the usage statistics indicate frequency of use of anticipated-content segments; and 

3 the computer-readable medium includes the steps of 

4 selecting a set of anticipated-content segments based on said usage statistics; 

5 and 

6 removing recordings associated with the selected anticipated-content 
; 3 7 segments. 

1 19. The computer-readable medium of Claim 1 8 wherein the step of selecting a set of 

2 anticipated-content segments includes selecting a set of anticipated-content segments 
: ^ 3 that were least frequently used during a period of time. 

1 20. The computer-readable medium of Claim 13 further comprising the steps of: 

2 recording a plurality of recordings for a particular anticipated-segment; 

□ 3 storing data that indicates rules for selecting between said plurality of recordings; and 

4 when said text contains said particular anticipated-content segment, applying the rules 

5 indicated in said data to select one of said plurality of recordings; and 

6 generating speech for said particular anticipated-segment using said selected 

7 recording. 

1 2 L The computer-readable medium of Claim 20 wherein: 

2 the text is from a particular source; and 

3 the step of applying the rules includes determining which of said plurality of 

4 recordings to select based at least in part on identity of said particular source. 
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1 22. The computer-readable medium of Claim 20 wherein: 

2 the anticipated-content segment appears in a particular context within said text; and 

3 the step of applying the rules includes determining which of said plurality of 

4 recordings to select based at least in part on said particular context. 
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ABSTRACT OF THE DISCLOSURE 

Techniques are provided performing text-to-speech translation in situations in which 
the input texts may contain unanticipated content. According to one aspect of the invention, 
text-to-speech services are provided by splitting a text into segments that include anticipated- 
5 content segments and unanticipated-content segments. Speech for the anticipated-content 
segments is generated based on pre-recorded sound recordings that correspond to the 
anticipated-content segments. Speech for the unanticipated-content segments is generated 
using speech synthesis. Usage statistics are recorded. The usage statistics identify which 
segments are contained in texts that are translated using the text-to-speech services. In one 
10 embodiment, the usage statistics indicate frequency of use of unanticipated-content segments 
f;3 and, based on the usage statistics, a set of unanticipated-content segments for which to make 
Vi recordings is selected. In another embodiment, the usage statistics indicate frequency of use 
\n of anticipated-content segments, and a set of anticipated-content segments is selected based 

on the usage statistics. The recordings associated with the selected anticipated-content 
7'i 5 segments are then removed. 
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Patent 



DECLARATION FOR PATENT APPLICATION 
As a below named inventor, i hereby declare that: 

My residence, post office address and citizenship are as stated below, next to my name. 

I believe I am the original, first, and sole inventor (if only one name is listed below) or an original, first, and 
joint inventor (if plural names are listed below) of the subject matter which is claimed and for which a 
patent is sought on the invention entitled 

" ADAPTIVE HOSTED TEXT TO SPEECH PROCESSING" 
the specification of which 

X is attached hereto. 

was filed on as 

United States Application Number , 

or PCT International Application Number , 

and was amended on . 

(if applicable) 

I hereby state that I have reviewed and understand the contents of the above-identified specification, 
including the claims(s), as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information known to me to be material to patentability as defined in 
Title 37, Code of Federal Regulations, Section 1 .56 (copy attached). 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 1 19(a)-(d), on any 
foreign application(s) for patent or inventor's certificate listed below and have also identified below any 
foreign application for patent or inventor's certificate having a filing date before that of the application on 
which priority is claimed: 



Prior Foreiqn Aoplication(s) 






Priority 
Claimed 




(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 


(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 


(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 



I hereby claim the benefit under Title 35, United States Code, Section 1 19(e) of any United States 
provisional application(s) listed below 



(Application Number) (Filing Date) 



(Application Number) 



(Filing Date) 



! hereby claim benefit under Title 35, United States Code, Section 120 of any United States application (s) 
listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in 
the prior United States application in the manner provided by the first paragraph of Title 35, United States 
Code, Section 1 12, 1 acknowledge the duty to disclose all information known to me to be material to 
patentability as defined in Title 37, Code of Federal Regulations, Section 1 ,56 (copy attached) which 
became available between the filing date of the prior application and the national or PCT International filing 
date of this application: 



(Application Number) 


(Filing Date) 


(Status 


- patented, pending, abandoned) 


(Application Number) 


(Filing Date) 


(Status 


- patented, pending, abandoned) 


(Application Number) 


(Filing Date) 


(Status 


- patented, pending, abandoned) 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made 
on information and belief are believed to be true; and further that these statements were made with the 
knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful false statements may 
jeopardize the validity of the application or any patent issued thereon. 



Full Name of Sole/First Inventor (given namejamily name) JACOB CHRISTFORT 

Inventor's Signature C^_^~ Date ^ rZlS" < 0c> 

Residence San Francisco, California Citizenship Denmark 

(City, State) (Country) 

Post Office Address 1790 Broadway, San Francisco, CA 94109 



Full Name of Second Inventor (given name, family name) 

Inventor's Signature Date 



Residence 

(City, State) 

Post Office Address 



Citizenship 

(Country) 



Full Name of Third Inventor (given name, family name) 



Inventor's Signature Date 



Residence ^ Citizenship 

(City, State) (Country) 

Post Office Address 
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Title 37, Code of Federal Regulations, Section 1 .56 
Duty to Disclose information Material to Patentability 



(a) A patent by its very nature is affected with a public interest. The public interest is best served, 
and the most effective patent examination occurs when, at the time an application is being examined, the 
Office is aware of and evaluates the teachings of all information material to patentability. Each individual 
associated with the filing and prosecution of a patent application has a duty of candor and good faith in 
dealing with the Office, which includes a duty to disclose to the Office all information known to that 
individual to be material to patentability as defined in this section. The duty to disclose information exists 
with respect to each pending claim until the claim is canceled or withdrawn from consideration, or the 
application becomes abandoned. Information material to the patentability of a claim that is canceled or 
withdrawn from consideration need not be submitted if the information is not material to the patentability of 
any claim remaining under consideration in the application. There is no duty to submit information which 
is not material to the patentability of any existing claim. The duty to disclose all information known to be 
material to patentability is deemed to be satisfied if all information known to be material to patentability of 
any claim issued in a patent was cited by the Office or submitted to the Office in the manner prescribed by 
§§ 1 .97(b)-(d) and 1 .98. However, no patent will be granted on an application in connection with which 
fraud on the Office was practiced or attempted or the duty of disclosure was violated through bad faith or 
intentional misconduct. The Office encourages applicants to carefully examine: 

(1) Prior art cited in search reports of a foreign patent office in a counterpart application, and 

(2) The closest information over which individuals associated with the filing or prosecution of a 
patent application believe any pending claim patentably defines, to make sure that any material 
information contained therein is disclosed to the Office. 

(b) Under this section, information is material to patentability when it is not cumulative to information 
already of record or being made of record in the application, and 

(1) It establishes, by itself or in combination with other information, a prima facie case of 
unpatentability of a claim; or 

(2) It refutes, or is inconsistent with, a position the applicant takes in: 

(i) Opposing an argument of unpatentability relied on by the Office, or 

(ii) Asserting an argument of patentability. 

A prima facie case of unpatentability is established when the information compels a conclusion that a 
claim is unpatentable under the preponderance of evidence, burden-of-proof standard, giving each term in 
the claim its broadest reasonable construction consistent with the specification, and before any 
consideration is given to evidence which may be submitted in an attempt to establish a contrary 
conclusion of patentability. 

( c ) Individuals associated with the filing or prosecution of a patent application within the meaning 
of this section are: 

(1) Each inventor named in the application; 

(2) Each attorney or agent who prepares or prosecutes the application; and 

(3) Every other person who is substantively involved in the preparation or prosecution of the 
application and who is associated with the inventor, with the assignee or with anyone to whom there is an 
obligation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by disclosing 
information to the attorney, agent, or inventor. 
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Docket No. 50277-1567 
(OID #2000-132-01) 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Application of 



JACOB CHRISTFORT 



Serial No, Not Yet Assigned 



Group Art Unit: Not Yet Assigned 



Filed: 



Examiner: Not Yet Assigned 



For: 



ADAPTIVE HOSTED TEXT TO SPEECH PROCESSING 



POWER OF ATTORNEY 
AND REVOCATION OF PREVIOUS POWERS 



Honorable Assistant Commissioner 

for Patents 
Washington, DC 20231 

Sir: 

Oracle Corporation, a Delaware corporation having a place of business at 500 Oracle 
Parkway, Box 50P7, Redwood Shores, California 94065, certifies that to the best of its knowledge 
and belief it is the assignee or is entitled to ownership of the entire right, title, and interest in and 
to the above-referenced patent application by virtue of an Assignment filed concurrently herewith 
and represents that the undersigned is a representative authorized and empowered to sign on behalf 
of Oracle Corporation, which hereby revokes all powers of attorney previously given and appoints 
the following attorney(s) and/or agent(s): Edward A. Becker, Reg, No. 37,777; Marcel K. Bingham, 
Reg. No. 42,327; Carl L. Brandt, Reg. No. 44,555; Brian D. Hickman, Reg. No. 35,894; Craig G. 

g Holmes, Reg. No. 44,770; Christopher J. Palermo, Reg. No. 42,056; Carina M. Tan, Reg. No. 

j 45,769 and Bobby K. Truong, Reg. No. 37,499 all of 



Sanjay Prasad, Reg. No. 36,247; and Roger Kennedy, Reg. No. 44,823, of ORACLE 
CORPORATION 

with full power of substitution and revocation, to prosecute this application and to transact all 
business in the Patent and Trademark Office connected therewith. Send all future correspondence to 
the attention of Brian D. Hickman, Reg. No. 35,894, care of the above address and direct all 
telephone calls to the same at (408) 414-1080. 



HICKMAN PALERMO TRUONG & BECKER LLP 
1600 Willow Street 
San Jose, CA 95125 



and 



Dated: U/01/qo 



Assignee of Interest: 

Oracle Corporation 



By: VUJla. 
Name: VUrAOj^/. ^ 




Address of Assignee of Interest: 
— Oracle Cor poration 



500 Oracle Parkway - Q ox 659S07 
Redwood Shores. CA Qdftf* 



Dated: ///f/^O 



1600 Willow Street 
San Jose, CA 95 125 
Telephone: (408) 414-1080 
Facsimile: (408) 414-1076 



Respectfully submitted, 

HICKMAN PALERMO TRUONG & BECKER LLP 



By: 



Name -' Brian D.H^Vman 



Reg.No.:__35 4 894_ 



2 



